Job-related exhaustion risk variant in UST is associated with dementia and DNA methylation

Previous genome-wide association and replication study for job-related exhaustion indicated a risk variant, rs13219957 in the UST gene. Epidemiological studies suggest connection of stress-related conditions and dementia risk. Therefore, we first studied association of rs13219957 and register-based incident dementia using survival models in the Finnish National FINRISK study surveys (N = 26,693). The AA genotype of rs13219957 was significantly associated with 40% increased risk of all-cause dementia. Then we analysed the UST locus association with brain pathology in the Vantaa 85+ cohort and found association with tau pathology (Braak stage) but not with amyloid pathology. Finally, in the functional analyses, rs13219957 showed a highly significant association with two DNA methylation sites of UST, and UST expression. Thus, the results suggest a common risk variant for a stress-related condition and dementia. Mechanisms to mediate the connection may include differential DNA methylation and transcriptional regulation of UST.


Associations of rs13219957 with incident dementia and competing risk of death in the FIN-RISK study
Association of rs13219957 genotypes with the risk of all-cause dementia and AD was studied using two survival models in parallel, accounting for the competing risk of death, the Poisson cause-specific hazard model and Fine-Gray sub-distribution hazard model.We refer here to the fully adjusted models if not otherwise stated.
The AA genotype of rs13219957 showed association with an increased risk of all-cause dementia in both Poisson model (IRR = 1.40 Rs13219957 AA also showed a significant association with a decreased risk of competing event, death without dementia, (Poisson model, IRR = 0.71(0.58-0.88);Fine-Gray model, HR = 0.69(0.50-0.93),Table 2).When analysing rs13219957 AA against all-cause mortality, no significant association was detected (Poisson model, IRR = 0.81(0.63-1.06)).Figure 1 demonstrates the cumulative incidence patterns for the different genotype groups.The cumulative incidence of all-cause dementia as well as AD are increased in the carriers of the AA genotype after 70 years of age, and the incidence of the competing event is lower after 55 years of age, and more clearly after 75 years.No difference is seen in the all-cause mortality between the genotype groups until the age of 90 years.

Secondary analyses in the FINRISK study
In the secondary analyses, correction for the population components C1-C3 from the GWAS did not change the results when compared to the uncorrected analysis, suggesting that population stratification did not affect the results, but in the smaller sample for the GWAS data, the association of rs13219957 AA was non-significant (Supplementary Table S2).
To examine the impact of missing genotype data on our main results, analyses using the basic Poisson model for rs13219957, as shown in Table 2, were conducted again incorporating a separate category for missing genotype data.The effect sizes obtained were very similar to those in Table 2.Only the association between AA genotype of rs13219957 and all-cause dementia now included 1.00 within the confidence intervals, IRR = 1.37(1.00-1.86)(Supplementary Table S3).The rs13219957 genotypes were in accordance with Hardy-Weinberg Equilibrium (HWE) (Supplementary Table S4).
Next, we examined whether the association of rs13219957 AA with all-cause dementia was mediated by the experience of work-related stress, insomnia, or depressive mood.Covariation for insomnia did not affect the strength of the association of rs13219957 AA with dementia (Poisson model, IRR = 1.39(1.01-1.91))(Supplementary Table 5).Covariation for work-related stress and depressive mood strengthened the association (Poisson   S5).Workrelated stress and the AA genotype of rs13219957 did not show statistically significant interaction (interaction terms: Poisson model, IRR = 2.09(0.69-6.34)).

Fine mapping of the UST locus
We performed variant fine mapping of the genetic loci in the FINRISK Study surveys.By considering genetic linkage disequilibrium around rs13219957, we observed a haplotype block of 16 kb (bp 148,936,606-148,953,102; rs10457837-rs2500512).The minor allele A of rs13219957 (allele frequency 0.147) was included in only one haplotype, while the major allele G occurred on nine different haplotypes.In the haploblock area of rs13219957, 18 SNPs showed r 2 > 0.2 with rs13219957.Of those variants, 10 demonstrated significant associations with allcause dementia in the Poisson model using the GWAS data of the FINRISK Study (Table 3).The effect size was the highest for the original SNP rs13219957.

UST locus and post-mortem neuropathology in the Vantaa 85+ sample
The 10 dementia-associated SNPs of the UST locus and rs13219957 were analysed for association with AD neuropathology.All the SNPs, except rs13219957, showed nominally significant (P < 0.01) association with the BRAAK staging score assessing the spread of neurofibrillary tangle tau pathology.No association was found with beta-amyloid load, clinical AD, or all-cause dementia (all P > 0.5).The results in the Vantaa 85 + cohort are presented in Table 4 and in Supplementary Figs.S1-S3.The risk alleles of the SNPs showing association with the BRAAK stage highlight three primary risk haplotypes, as shown in Fig. 2.  www.nature.com/scientificreports/

Association with DNA methylation, UST brain expression, and open chromatin region
To examine the potential functional mechanisms behind the genetic associations, we studied association of rs13219957 and the 10 other SNPs (marked in Fig. 2) with DNA methylation levels in the UST gene.DNA methylation was analysed in peripheral blood leukocytes in 454 individuals with non-missing covariates from the FINRISK Study survey of 2007.The basic characteristics of individuals for whom methylation data were either available or unavailable are reported in Supplementary Table S6.Associations of the 11 SNPs were calculated for all the available 92 CpG methylation sites in the UST gene area and UTRs.The rs13219957 minor allele showed a highly significant association with lower methylation levels at cg23982392 (P FDR = 1.1 × 10 -8 ) when multiple testing was corrected for and a less significant association with higher levels of cg062593339 (P FDR = 0.012).www.nature.com/scientificreports/ The 10 other SNPs also showed association with lower methylation at cg23982392, but weaker than that of rs13219957 (Table 5).The CpG site with the strongest association with the SNPs, cg23982392, is situated on top of a SNP rs75776863, which changes cytosine to thymine (T).The rare minor allele T of rs75776863, with frequency of 0.046, destroys the CpG site for DNA methylation.Our dataset includes two subjects with homozygosity for rs75776863 TT.Both had very low methylation M-values as supposed, and heterozygotes for rs75776863 had average methylation M-values (Supplementary Fig. S4).When analyzing LD, rs75776863 and rs13219957 showed R 2 0.081 and D' 0.54.The TT homozygotes for rs75776863 were heterozygotes for rs13219957.Supplementary Fig. S4 demonstrates the LD between rs13219957 and rs75776863 and connection with cg23982392 methylation status.
The survival analyses for all-cause dementia using the FINRISK data were performed post hoc for rs75776863 heterozygotes (Supplementary Table S7).The analyses showed significant association with all-cause dementia in the basic model (IRR = 1.22(1.03-1.44))but not in the fully adjusted model (1.17 (0.98-1.41)).The survival analyses were not possible for the TT homozygotes (N = 52) who were all dementia-free.When excluding TT homozygotes and heterozygotes for rs75776863, the strength of association of rs13219957 with all-cause dementia was not weakened suggesting also an independent effect of rs13219957 (Supplementary Table S8 vs. Supplementary Table S2).
The brain expression quantitative trait loci (eQTL) analyses using publicly available datasets revealed significant associations (P < 0.05) of the UST locus SNPs, excluding rs13219957, with lower UST brain expression in the ROSMAP 19 and GTEx project data 20 .Borderline significant associations (P = 0.052) with the same direction of effect were observed in the AD patients but not in the non-AD patients in the eGWAS Mayo dataset 21 .No significant eQTL associations were detected in the microglial cell data 22 .Nevertheless, in microglial cells, the risk alleles of the UST locus SNPs (apart from rs13219957) demonstrated borderline significant negative associations (P = 0.032-0.055)with the open chromatin region (OCR) in the UST promoter region.The eQTL findings are presented in Supplementary Tables S9 and S10, and the OCR results in Supplementary Table S11.

Discussion
This study revealed an association of rs13219957, a previously reported genetic risk variant for job-related exhaustion, with all-cause dementia in the large Finnish epidemiological FINRISK Study.No significant association with clinically defined AD was found, but association of the genetic locus with tau pathology of AD was detected in the smaller Vantaa 85+ cohort (though not with rs13219957).The strong association of rs13219957 with DNA methylation sites of the UST gene suggests epigenetic regulation of UST as a potential mechanism.Previously, UST has not been listed as a risk gene for dementia or AD pathology.In contrast to the large multi-population studies, our Finns-only study may have identified specific genetic variants.Genetic homogeneity results in a reduced number of risk alleles (less allelic heterogeneity) and higher LD with longer haplotype blocks 23 .On the other hand, the use of survival models modelling competing risk of death is likely to increase our power when compared with multi-national cross-sectional analysis, and some weaker genetic signals may emerge.Summary statistics from the large GWAS for AD and AD-proxy 24 show a weak but nominally significant association of rs13219957 with AD (P = 0.041, OR = 1.01(1.00-1.01),n = 452,010).That result, together with the association of the haplotype block with tau pathology, suggest that rs13219957 may also be a weak risk factor for AD and not only for all-cause dementia.
The strongest association signal in our study was observed, however, from the non-AD dementia cases.The mean age of onset of the dementia was 63.9 years, which indicates that among these are the early-onset cases (Table 1).Based on the prevalence of different dementias at this age, this non-AD group may consist of AD without adequate diagnostics or atypical symptoms, cases of dementia with Lewy bodies, vascular dementia, frontotemporal dementia, and alcohol-related dementia 25 .Tauopathies and hippocampal sclerosis are so rare that these are unlikely to contribute that signal, but unfortunately more specific diagnostics was not possible in our cohort.
Rs13219957 was significantly associated with methylation levels at two methylation sites, The association of the risk allele of rs13219957 with decreased methylation of cg23982392 is likely to be explained by weak LD with rare rs75776863, which can directly regulate possibility for methylation at that CpG site.However, our post hoc analyses suggested that rs75776863 would not explain the association of rs13219957 with dementia (Supplementary Table S4).
cg23982392 is situated only 26 bp from the binding site of CCCTC-binding factor (CTCF) in the intron of UST.Methylation in the CTCF binding site effectively inhibits binding of this multi-functional protein 26 , which functions for example as an insulator, chromatin remodeller, transcription factor and architectural protein 27 .Intronic CTCFs have been shown to control gene expression and regulate alternative splicing.Association of the UST locus SNPs with lower UST brain expression was, indeed, seen in two open datasets (Supplementary Table S8) suggesting that the association with dementia may be mediated by differential UST expression.Similar to the findings from the neuropathological analyses, the association signal was detected for other SNPs but not rs13219957.In line with the direction of the effect in the expression analyses, negative association with OCR in the promoter region of UST was seen for some of the UST locus SNPs (Supplementary Table S10).Interestingly, the same study highlighted UST as a gene with most significant association with cognitive resilience, lower UST expression correlating with greater cognitive decline.
UST functions as an enzyme adding sulfate groups to the uronyl residues of dermatan sulfate and, more weakly, to iduronyl residues of chondroitin sulfate 28 .Dermatan sulfate and chondroitin sulfate are intermediately sulfated glycosaminoglycans.In vitro, dermatan sulfate increases AD-like tau changes by stimulating tau phosphorylation, which is an early event in the formation of neurofibrillary tangles.The amount of sulfate groups in glycosaminoglycans correlates with higher amount of tau changes 29 .However, most of the evidence www.nature.com/scientificreports/comes from the heparin sulfate 30 .We discovered association of the UST locus (except rs13219957) with Braak stage (Table 4) suggesting that this locus has an effect on tau pathology.However, the association of the UST locus risk alleles with lower UST expression conflicts with previous functional evidence suggesting that a higher amount of sulfation is linked to a higher amount of tau pathology.Thus, the molecular mechanisms behind these associations remain to be clarified.Since mortality due to other causes is a competing event for dementia, we demonstrated here, in parallel, two survival models accounting for competing risk of death, Poisson and Fine-Gray, as recommended in 31 .Sometimes differences can be detected between aetiological risk factors, emphasised by the cause-specific hazard model, and factors that affect real cumulative incidence in the presence of competing events, emphasised by the Fine-Gray sub-distribution hazard model.In our study, associations of rs13219957 with all-cause dementia were significant in both fully adjusted models, Fine-Gray showing a slightly stronger association (Table 2).
In addition to all-cause dementia, we found association of rs13219957 with the competing risk of death (Table 2).Since no association of rs13219957 with all-cause mortality (which included cases of dementia) was detected, it is not likely to have a causal relationship with mortality, but the detected association is likely to be biased and caused by the competing effect of dementia diagnoses in our models 32 .
Individual differences in proneness to stress, assessed by the temperament trait neuroticism, have been linked to the risk of AD in several studies 9,12,13 , and no studies without significant association have been published to our knowledge.Previously, only our previous study of an MTNR1A variant has suggested genetic connection between proneness to stress (job-related exhaustion in shift work) and AD 17,18 .Here, the analyses adjusted for work-related stress suggested, however, that the association of rs13219957 is not mediated by increased susceptibility to experiencing work stress (Supplementary Table S4), but rs13219957 may show a separate risk effect for both conditions.Further studies, using for example polygenic risk scores, are needed to examine potential shared genetic risk factors for job-related exhaustion and dementia on a genome-wide scale.
This study has some limitations.First of all, the DNA methylation analyses were performed in blood leukocytes, while the more relevant cells for dementia and AD would be cortical or hippocampal neurons and glia.
Another limitation is the lack of disease diagnoses other than AD, which prevented studying the association of rs13219957 with specific diagnoses of non-AD dementia.We anticipated that the Finnish healthcare registers did not provide reliable enough clinical diagnoses for the most common types of dementia diseases after AD-vascular dementia and dementia with Lewy bodies-because no specific medications are available for either dementia type.For AD, the data on medication and special reimbursement was the most important source of diagnoses 1 .In addition, the clinical differentiation of different types of dementia (including AD) is challenging because of the commonness of the mixed pathologies for AD and vascular dementia and AD and dementia with Lewy bodies 33 .
Thirdly, the findings of this study cannot be generalized to the Finnish population due to selective participation overall 34 and specifically in this genetic study (Supplementary Table S1).However, consistent results in the analyses that include the missing rs13219957 genotype as a separate category, along with genotype frequencies following HWE (Supplementary Table S4), suggest that the missing genotype data is unlikely to influence our results.
This large prospective study with careful statistical modelling suggests that a previously reported genetic risk variant for job-related exhaustion in middle-age, rs13219957 in UST, was associated with the risk of all-cause dementia.The association may be mediated by differential DNA methylation, transcriptional regulation of UST, and induction of tau pathology of AD.Association of rs13219957 with different types of dementia requires further studies.

Method Participants
The national FINRISK study The National FINRISK Study has examined the health and risk factors of the Finnish population.New participants were recruited every five years from 1972 to 2012 34 .Study subjects for this genetic study were derived from the National FINRISK Study surveys of 1992, 1997, 2002 and 2007.At baseline, the participants were 25-74 years old.For our analyses, the separate surveys were pooled together.Characteristics of our data by genotype groups are presented in Table 1.The study characteristics by survey are presented previously 34 .
The ethical committee of the Finnish Institute of Health and Welfare and/or the Coordinating Ethical Committee of Helsinki and Uusimaa Hospital District approved the study protocols for the FINRISK surveys 1992-2007.

Vantaa 85+ study
The Vantaa 85+ is a Finnish population-based cohort of the oldest old consisting of the over 85-year-old population of the Finnish city of Vantaa in 1991.All 553 participants were followed up until death with clinical examinations or via healthcare records.Of those, 534 gave their sample for the genetic study 18 .Of the Vantaa 85+ cohort, 304 individuals underwent post-mortem examination for neuropathology of AD 35 .The maximum density of beta-amyloid plaques was assessed with a CERAD (Consortium to Establish a Registry for Alzheimer's disease) score 36 , and the spread of neurofibrillary tangles using BRAAK staging 37 .
The study protocols obtained permissions from the Local ethical committees.The Ministry of Social Affairs and Health granted permission to use the health and social care notes for the Vantaa 85+ study participants and the Center for Medico-Legal Affairs to collect and use autopsy tissue.All participants, and their relatives in case of dementia, gave their informed consent.www.nature.com/scientificreports/Follow-up and events The individuals were followed up from baseline to the first event, whichever occurred first: (1) diagnosis of dementia, (2) death without dementia, and (3) censoring at the end of the follow-up (end of 2017).Individuals with prevalent dementia were excluded (n = 26).Thus, depending on the FINRISK Study survey (1992, 1997,  2002 or 2007) the length of the follow-up was 10, 15, 20 or 25 years, respectively.The diagnoses of dementia and AD were derived from the Finnish national registers which were linked to the FINRISK Study: the Causes of Death Register, the Hospital Discharge Register and the Drug Reimbursement Register and information from prescribed and purchased dementia drugs from the Social Insurance Institution.Information from the registers was combined for this study.The diagnosis of AD included, in 1992-1995, code 3310 ICD-9 (International Classification of Diseases) and from 1996 onwards, F00 and G30 (ICD-10).All-cause dementia was diagnosed if the individual fulfilled the criteria for AD or if they had codes 3310, 4378A, 290 (ICD-9) or from 1996 onwards F00, F01, F02, F03, G30 (ICD-10).In the Drug Reimbursement Register, AD was diagnosed based on the reimbursement for AD medication (Social Insurance Institution code 307 for ATC class N06D, anti-dementia drugs: donepezil, rivastigmine, memantine and galantamine).Cases of all-cause dementia but without AD (non-AD dementia) who did not die during the follow-up were censored in the analyses of AD.The definition of the events in the FINRISK Study is described in detail previously 1 .

Genotyping
Genotyping in the FINRISK study In the FINRISK Study surveys, genotyping for rs13219957 were performed using iPLEX Gold chemistry (Sequenom, San Diego, CA, USA) as described previously 16 .After quality control (QC), the genotype was available for 26,693 individuals.Genotypes for rs13219957 and the rs13219957-linked variants were also derived from GWAS genotyping.The genotyping was performed with Illumina beadchips (Human610-Quad, HumanOmniExpress, HumanCoreExome) at the Wellcome Trust Sanger Institute (Cambridge, UK), at the Broad Institute of Harvard and MIT (Boston, MA, USA), and at the Institute for Molecular Medicine Finland (FIMM) Genotyping Unit (Helsinki, Finland).Imputation was performed with SHAPEIT2 [23269371] and IMPUTEv2 [19543373] using a custom haplotype set of 2000 whole-genome-sequenced (WGS) Finnish individuals and 1000 Genomes project phase-3 haplotypes as reference panels.After QC for rs13219957, the number of individuals with genotypes was 24,028.Since, in the HumanCoreExome chip, rs13219957 was not directly genotyped but the genotype was estimated by imputation, we performed our main analyses with the iPLEX genotypes, which also offered a larger sample size, and used GWAS genotypes for fine mapping of the haplotype block area.Of the 22,869 individuals interrogated with different genotyping methods, 57 (0.25%) received different genotypes with the different methods.Principal component analysis (PCA) for population stratification was calculated with PLINK 1.9.The top three principal components (C1-C3) were used to adjust the effect estimates for population stratification.

Genotyping in Vantaa 85+ study
In the Vantaa 85+, genotyping was performed with Illumina Infinium Human370 BeadChips (Illumina, San Diego, CA, USA) as previously described 38 .Chip genotyped samples were merged with a larger dataset originating from a similar genotyping approach and pre-phased with Eagle 2.3.5 (https:// data.broad insti tute.org/ alkes group/ Eagle/) with the default parameters, except the number of conditioning haplotypes was set to 20,000.
Genotype imputation was carried out by using the population-specific SISu v3 imputation reference panel with Beagle 4.1 (version 27Jan18.7e1,https:// facul ty.washi ngton.edu/ brown ing/ beagle/ b4_1.html) as described in the following protocol: https:// doi.org/ 10. 17504/ proto cols.io.nmndc 5e.High-coverage (25-30×) WGS data used to develop the SISu v3 reference panel were generated at the Broad Institute of Harvard and MIT and at the McDonnell Genome Institute at Washington University, and jointly processed at the Broad Institute.The variant callset was produced with the GATK HaplotypeCaller algorithm by following GATK best-practices for variant calling.Genotype-, sample-and variant-wise QC was applied in an iterative manner by using the Hail framework (https:// github.com/ hail-is/ hail) v0.1 and the resulting high-quality WGS data for 3775 individuals were phased with Eagle 2.3.5, as described above.
In post-imputation QC, variants within+-100kB of the UST locus were selected, and variants with an imputation INFO (information) score < 0.3 or a minor allele frequency < 0.01 were removed.

Covariates
In the survival analyses of the FINRISK Study, the basic model was adjusted for sex, educational classes (sexspecific and birth-cohort specific tertiles of schooling years), and the FINRISK Study survey year.The follow-up time (> 16 years/ < 16 years) and age at the end of follow-up (5 years time slots) were accounted for in the Lexis data frame in the Poisson model.In the fully adjusted model, we added covariates of systolic blood pressure (average of one to three measurements), body mass index (BMI), total cholesterol, smoking (non-/current/exsmoker), physical activity in three classes (combined occupational, leisure and commuting activity as described in 39,40 ), and diabetes.
Work-related stress, as well as insomnia, were used as covariates in the secondary analyses.One question assessed work-related stress in the FINRISK Study surveys of 1992-2002: "How often are you troubled by having to stretch your strength to the extreme to be able to cope with your present work or workload?".The response options were: 1 = almost all the time, 2 = quite often, 3 = sometimes, 4 = seldom, 5 = never, 6 = I do not work.We grouped answers 1-2 (high stress) and 3-5 (low stress) together and reversed the scale.Answer 6, I do not work, was excluded.Insomnia was assessed in the FINRISK Study surveys of 1992-2007 with the question: "Do you suffer from insomnia?": 1 = often, 2 = sometimes, 3 = not at all.

Statistical analyses
We considered death and incidence of dementia as competing risks and fitted two alternative models: Poisson Cause-specific hazard model and Fine-Gray sub-distribution hazard model 31 .In multiplicative Poisson models, death or incidence of disease was modelled using the log-link function and log length of follow-up time as an offset.Incidence rate ratios (IRRs) of that model are very similar to the hazard rates (HRs) of Cox's proportional hazards model, but without the proportionality assumption.The Poisson cause-specific hazard model has been suggested to emphasise aetiological risk factors 41 , while the Fine and Gray sub-distribution model emphasises prognostic risk.
For the Poisson model, we used Lexis data frame to include two timescales in the analysis: age and the time since start of follow-up.Cut points for age were 65, 70, 75, 80, and 85 years, and for follow-up time 16 years.R software (version 3.6.0,R Foundation for Statistical Computing, Vienna, Austria.https:// www.R-proje ct.org/.)package Epi (version 2.40) was used for the Lexis data frame and packages cmprsk (version 2.2-9) and survival (version 2.44-1.1)for survival analyses.
In the secondary analyses the Poisson model analyses were additionally adjusted for insomnia, work-related stress or the top three principal components of GWAS data (C1-C3).
All the methods were carried out in accordance with relevant guidelines and regulations.

Fine mapping
To fine map the associations of the genetic area of rs13219957, we first examined its haplotype structure in the FINRISK Study surveys using the imputed GWAS data.Haplotypes were defined using the confidence interval algorithm in the Haploview programme 42 .In the haplotype block area, we selected all the genetic variants in linkage disequilibrium (LD), r 2 > 0.20, with rs13219957 using PLINK (version 1.90b4.1) 43.Those variants were studied against all-cause dementia and AD in the FINRISK Study surveys using fully adjusted Poisson model.

Neuropathological analyses
Rs13219957 and the fine mapped variants with significant associations with all-cause dementia in the FINRISK Study surveys were studied further with neuropathological and clinical measures in the Vantaa 85+ using linear or logistic regression analyses with sex as a covariate.

Analyses of DNA methylation analyses, UST expression, and open chromatin region
Methylation analyses were performed in the DILGOM subset of the FINRISK Study survey of 2007 (n = 454 with non-missing covariates) using the HumanMethylation450K BeadChip (Illumina, Inc., San Diego, CA, USA).The preprocessing, QC, and of the data were performed using the R package minfi 44 .After the background correction, methylation values were normalised using the subset quantile normalisation method (SWAN) 45 .After the standard QC procedures, we removed the following probes: (a) probes with detection P value > 0.01; (b) crossreactive probes and probes containing a single nucleotide polymorphism (SNP), as described in 46 .We extracted all the available methylation sites in the UST gene area and untranslated regions (UTRs), which numbered 92.The proportions of white blood cells were calculated by the Houseman method 47 .For the subsequent association analyses, we used M-values (logit-transformed beta values).For testing the associations of the genetic variants with methylation M-values of the CpGs, we used a multiple linear regression model adjusted for age, sex, smoking status, alcohol consumption, cell counts, and technical variables array and slide.The P values were adjusted to control the false discovery rate (FDR) in multiple hypothesis testing using the Benjamini-Hochberg procedure, and the FDR-adjusted P value < 0.05 was used to assess statistical significance.
Analyses examining the associations between UST locus SNPs and UST brain expression (eQTL) were conducted using publicly available data from the ROSMAP project 19 (dorsolateral prefrontal cortex) and the GTEx project portal 20 (the corresponding tissue, brain cortex) as well as the eGWAS Mayo project (temporal cortex) 21 including patients with definitive AD, and non-AD individuals (mostly with other brain pathologies) 17 .
Additionally, in a dataset from the study on human microglial cells by Kosoy et al. 22 , we explored microglial cell eQTLs and the association of the UST locus SNPs with the open chromatin region (OCR) in the UST promoter region (Peak_174460) assessed using ATACseq method.

Figure
Figure 1.Cumulative incidence of all-cause dementia (A), Alzheimer's disease (B), competing event, death without dementia (C), and mortality (D) by rs13219957 genotypes in the FINRISK Study surveys.

Table 1 .
Baseline characteristics and incident dementia and mortality of the study population according to rs13219957 genotype.SD standard deviation, AD Alzheimer's disease, BMI body mass index.a One-way ANOVA, equal variances and means.b Pearson Chi-Square test.

Table 2 .
Association of rs13219957 with incident all-cause dementia, Alzheimer's disease (AD), non-AD dementia, and the competing risk of death.*Model adjusted for FINRISK survey year, sex, educational class.**Model adjusted for FINRISK survey year, sex, educational class, body mass index, systolic blood pressure, total cholesterol, smoking, physical activity, and diabetes.Follow-up time and age at the end of the follow-up were accounted for in the Lexis data frame.***Model adjusted for sex and educational class.
****Model adjusted for sex, educational class, body mass index, systolic blood pressure, total cholesterol, smoking, physical activity, and diabetes.AD alzheimer's disease, IRR incidence rate ratio.

Table 3 .
Associations of the rs13219957-linked variants (r 2 > 0.2 in the same haploblock area) with incident all-cause dementia and Alzheimer's disease in the fully adjusted Poisson cause-specific hazard model.*Model adjusted for FINRISK survey year, sex, educational class, body mass index, systolic blood pressure, total cholesterol, smoking, physical activity, and diabetes.Follow-up time and age at the end of the follow-up were accounted for in the Lexis data frame.Bold values represent significant associations at the CI 95% level.

Table 4 .
Association of the SNPs in linkage disequilibrium with rs13219957 with neuropathology of Alzheimer's disease and clinical dementias in Vantaa 85+.Linear regression analyses with sex as a covariate performed for measures of tau pathology (BRAAK stage) and beta-amyloid pathology (CERAD score), and logistic regression analyses with sex as a covariate for the diagnoses of dementia and Alzheimer's disease.SNP single nucleotide polymorphism, OR odds ratio, BP base pair, A1 minor allele, R2 square correlation measure for linkage disequilibrium, MAF minor allele frequency, CI confidence interval, CERAD consortium to establish a registry for Alzheimer's disease.

Table 5 .
Significant associations of rs13219957 and the neuropathology-linked SNPs with DNA methylation levels in the UST gene.Linear regression model was performed with sex, age, smoking status, alcohol consumption, array, slide and cell populations as covariates for 92 CpG sites and the SNPs (N = 454).Cg23982392, cg06259339 are situated in the body of UST.UST uronyl-2-sulfotransferase gene, FDR false discovery rate, NS non-significant, SNP single nucleotide polymorphism.